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ABSTRACT 

The  importance  of  reversal  complexity  as  a  basic  computational  resource 
has  only  been  recognized  in  recent  years.  It  is  intimately  connected  to 
parallel  time  complexity  and  circuit  depth.  In  this  paper,  we  develop 
some  basic  techniques  necessary  for  establishing  analogues  of  well- 
known  theorems  on  space  and  time  complexity.  Our  main  results  are 
that  for  reversal-constructible  functions  j(n)  ^  logn, 

DSPACE{s{n))  C  DREVERSAL{s{n)) 

and  the  first  tape-reduction  theorem.  As  apphcations  of  the  tape 
reduction  theorem,  we  prove  a  hierarchy  theorem  and  show  the 
existence  of  complete  languages  for  reversal  complexity. 
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1.    Introduction 

The  number  of  reversals  made  by  tape  heads  during  a  Turing  machine  computation 
has  assumed  new  importance  as  a  complexity  measure  in  complexity  theory.  This  is 
because  reversal  complexity  is  intimately  connected  with  uniform  circuit  depth  and 
parallel  time.  For  example,  Hong  [7]  showed  that  reversal  in  sequential  machine  models 
(including  the  standard  Turing  machines)  corresponds  to  parallel  time  in  parallel  machine 
models.  Also  Pippenger  [9]  show  that  simultaneous  time  and  reversal  in  Turing  machines 
are  polynomially  related  to  simultaneous  size  and  depth  in  uniform  circuits.  However, 
reversal  complexity  has  some  unexpected  properties  which,  until  recently,  made 
researchers  treat  it  as  a  curiosity  rather  than  as  a  fundamental  computational  resource. 
Baker  and  Book  [1]  showed  the  surprising  fact  that  every  recursively  enumerable  set  can 
be  recognized  by  a  nondeterministic  Turing  machine  making  at  most  two  tape  reversals. 
Moreover,  unlike  time  complexity  and  space  complexity  which  have  nice  properties  such 
as  'linear  speedup'  and  'tape  reduction',  reversal  complexity  has  defied  attempts  at 
finding  similar  theorems  in  the  multitape  Turing  machine  model. 

Hartmanis  [5]  was  the  first  to  study  reversal  complexity.  He  considered  on-line 
one-tape  Turing  machines  and  showed  that  for  such  machines,  reversal  and  space  are 
polynomially  related.  Moreover,  he  pointed  out  that  for  slow-growing  functions,  the 
'linear  speedup  theorem'  for  reversal  complexity  does  not  hold.  Fisher  [4]  gave  the 
'linear  speedup  theorem'  for  reversals  in  off-line  one-tape  Turing  machines  using  the  idea 
of  'crossing  sequences'.  However,  the  technique  of  Fisher  does  not  generalize  to 
multitape  Turing  machine  models. 

By  a  'tape  reduction  theorem'  for  reversal  complexity  we  mean  a  result  showing 
that,  for  some  slow-growing  function  /(n),  and  for  some  constant  t  ^  1,  any  multitape 
Turing  machine  using  r(n)  reversals  can  be  simulated  by  a  /t-tape  Turing  machine  using 
/('■('»))  reversals.  Call  this  a  (/(n),  k)-tape  reduction  theorem.  For  instance,  the  famous 
Hennie-Steams  [6]  simulation  achieves  a  (nlogn,  2)-tapc  reduction  theorem  for  time 
complexity.  There  is  also  a  (n,  l)-tape  reduction  result  for  space  complexity.  However, 
tape    reduction    theorem    for    reversal    complexity   seems   to    have   withstood    previous 


attempts,  and  the  only  published  indication  of  this  is  a  paper  by  Kameda  and  Vollmar  [8]. 
In  that  paper,  they  showed  that  any  r(n)  reversal-bounded  i-tape  Turing  machine 
without  stationary  moves  can  be  simulated  by  a  2-tape  Turing  machine  making  at  most 
6r(/i)  reversals.  Unfortunately,  it  appears  that  such  machines  are  too  restrictive  from  the 
viewpoint  of  reversal  complexity;  in  any  case  their  techniques  do  not  generalize.  One  of 
the  main  results  in  this  paper  will  be  a  (n-,  2)-tape  reduction  theorem  for  reversal 
complexity,  applicable  to  unrestricted  multitape  Turing  machines. 

We  give  some  definitions.  Our  machine  model  is  the  multitape  Turing  machine. 
Wc  follow  Yap  [13]  in  the  following  conventions:  by  a  '*-tape  Turing  machines',  we 
mean  a  luring  machine  with  a  2-way  read-only  input  tape  and  k  work  tapes. 

D  spacers)  = 

the  class  of  languages  accepted  by  deterministic  Turing  machines  that  use  at  most 
s{n)  tape  squares  of  the'work  tapes  on  inputs  of  length  n. 

NSPACE{s)  = 

the  class  of  languages  accepted  by  nondeterministic  Turing  machines  that  use  at 
most  s(n)  tape  squares  of  the  work  tapes  on  inputs  of  length  n. 

REMARK:  when  we  say  a  machine  M  'accepts'  L  in  space  s{n),  we  mean  that  for  all 
inputs  X  of  sufficiently  large  length,  if  x  is  in  L  then  some  path  accepts  after  using  at  most 
j([r|)  space  but  if  x  is  not  in  L,  then  no  bounds  is  imposed  on  the  space  used  on  any  path. 
This  convention  is  used  for  the  other  resources  as  well. 

DREVERSAL,(r)  = 

the  class  of  languages  accepted  by  t-tapc  deterministic  Turing  machines  which 
makes  at  most  0(r{n))  tape  reversals  on  inputs  of  length  n.  It  should  be  observed 
that  reversals  made  on  the  input  tape  is  counted. 

REMARK:  The  crucial  point  to  note  in  this  definition  is  that  we  use  '0(r(n))'  rather 
than  V(n)'.  This  is  because  we  do  not  have  a  linear  speedup  theorem  on  multitape 
Turing  machines  for  reversal  complexity. 

DREVERSAL{r)     =     KJ  ^DREVERSAL^^(,r) 

A  function  /(n)  is  said  to  be  g{n)-reversal  constructible  if  there  is  a  2-tape  Turing 
machine  that,  given  an  integer  n>0  in  unary  form,  produces  the  unary  form  of /(n)  on 
one  of  its  work  tapes  making  at  most  0(g{n))  tape  reversals.  If  /(n)  is  /(n)-reversal 
constructible,  we  say/(n)  is  reversal  constructible. 

In  section  2,  we  present  useful  techniques  for  efficiently  using  tape  reversals  on 
Turing  machines;  in  sections  3,  we  show  one  of  our  main  results,  namely, 
DSPACE{s{n))  Q  DREVERSAL(s(n)),  in  section  4,  wc  discuss  the  relationship  between 
space  complexity  and  reversal  complexity.  We  give  a  tape  reduction  theorem  for  general 
multitape  Turing  machines  in  section  5.  Finally,  we  give  some  applications  of  tape- 
reduction  result  in  sections  6  and  7  :  in  particular,  wc  show  that  most  common  reversal 


complexity  classes  have  complete  languages  and  that  hierarchy  theorems  exist  for  reversal 
complexity  classes. 


2.    Efficient  Use  of  Reversals 

Reversal  complexity  is  very  different  from  time  and  space  complexity.  Reversal 
complexity  seems  much  more  'powerful'  than  time  and  space  complexity.  For  example, 
only  rwo  reversals  on  a  2-tape  Turing  machine  are  enough  to  double  an  arbitrarily  large 
tape  segment.  In  any  case,  our  intuitions  about  reversal  complexity  are  not  as  well- 
developed  as  in  the  case  of  space  and  time.  Here  we  present  some  techniques  for 
efficiently  using  reversals  on  Turing    machines. 

Lemma  1.  (Natural  Number  Generation)  Given  an  integer  r>0  in  unary,  the  string 
S^  =  0#1#2#  ■  ■  •  #2"-l#  can  be  generated  by  a  2-tape  Turing  machine  M  making 
0{r)  reversals.  Here  m  denotes  the  binary  representation  of  the  integer  m  of  length  r, 
prefixed  with  O's  if  necessary. 

Proof.  M  first  generates  the  pattern  P^  =  (0'^#)'  on  one  of  its  tapes,  say  7,.  This 
can  be  done  within  0{r)  reversals.  Using  P^,  then  M  generates  (01)-^'  on  its  second  tape 
Tj,  making  a  constant  number  of  reversals. 

Now  Af  uses  r  phases,  each  making  a  constant  number  of  reversals,  to  fix  the  r  bits 
in  each  segment  of  zeroes  separated  by  #  in  P^.  Suppose  that  at  the  beginning  of  the 
(i+  1)^'  ((  2:  0)  phase,  M  has  fixed  the  first  i  bits  for  each  segment  of  P^  on  tape  7",,  and 

suppose  the  string  (0-1')'        is  inductively  available  on  tape  T^-    Here  the  first  bit  of  a 
segment  refers  to  its  least  significant  bit.    Note  that  the  (j  +  1)"  bit  of  the  /''  segment  is 

exactly  the  /*  bit  of  string  (O^'l^V       .    Thus  with  only  one  sweep  M  can  fix  the  [i+iy 
bit  for  each  segment  of  P ^  on  tape  T,. 

Note  that  in  the  (I'+l)"  phase  M  also  need  to  know  which  bit  in  the  /''  segment  is 
the  (/+!)"  bit.  This  is  easily  solved,  for  example,  by  placing  a  special  mark  on  another 
track  below  the  i'''  bit  of  each  segment.   These  marks  are  easily  updated  at  each  phase. 

Using  a  constant  number  of  sweeps,  M  can  generate  the  string  (0-'  'l-'"')'  from 
string  (0^1^)  on  tape  I,.    It  is  easy  to  check  that  at  the  end  of  the  r'^  phase,  the 

string  5,  =  0#r#2#    •  ■  •   I'-l*  is  on  tape  T",  of  M.    M  will  terminate  when  it  finds 
that  all  the  bits  have  been  fixed. 

This  completes  the  proof.   Q.E.D. 


A  string  of  the  form  x,#x,#  •  ■  ■  #x,#  (n  2:  1,  x,  €  X',  #  (  X)  is  called  a  Iim  of 
n  Items  (x  is  the  /"■  item)  A  list  is  said  to  be  in  normal  form  if  n  is  a  power  of  2  and 
each  X,  has  the  same  length  The  next  lemma  shows  how  to  convert  any  list  into  one  in 
normal  form. 

Lemma  2.  (List  Normalization)  There  is  a    2-tape  Turing  machine  M  which,  given  a 
list    X|#Xi#  ■    #x,#     of  n    items    on    its    mpui    tape,    can    construct    another    list 

>!#>;#     ■  ■  ■    #>,,#  in  O(logn)  reversals  satisfying 

(a)  2*-'  <  n  s  2*  ; 

(b)  each  \   (i  =  \ 2'' )  has  length       max      |xj,-     and 

1  =  I,         'I 

(c)  each    y,    (i     =1,   .  .  .  .n)    is    obtained    by    padding    x,     with    zeroes    and    each    _v, 
{]  =  n^\.  .  .  .  ,1' ]  is  a  string  of  zeroes. 

Proof.     First   M    computes    the    unary    representation    z^^  €    {0}'    of        max      \x.\    as 

1=1.  r      ' 

follows:  With  0(1)  reversals,  M  initializes  tape  1  to  have  all  the  odd  numbered  items 
from  the  original  list  and  tape  2  to  have  all  the  even  numbered  items.  So  tape  1  and  2 
contain  the  lists  x^^x-^#x^§  ■  ■  ■  and  x,#x^#x^#  ■  ■  ■  ,  respectively.  In  another  pass, 
M  compares  X2i-\  °°  '^P^  ^  v/hh  x^,  on  tape  2  (1  =  1,  2,  ...  .  ffi/2l),  marking  the  longer 
of  the  two  words.  In  0(1)  passes,  M  can  produce  a  new  list  7|#Z2#  ■  ■  ■  #2f„2l*  °^ 
these  marked  words.  M  repeats  this  process:  splits  the  list  into  two  with  roughly  the 
same  number  of  items,  compares  and  marks  the  longer  item  of  each  comparison, 
produces  a  new  list  consisting  of  only  the  marked  items.  After  O(logn)  reversals,  M  is 
left  with  a  list  containing  only  one  item.  This  item  has  the  longest  length  among  the  x,'s. 
It  is  now  easy  to  construct  Zq. 

The  next  goal  is  to  construct  a  string  of  the  form 

(^o#)''  (*  =  flog:"!). 

Suppose  wc  already  have  (zq#)-'.  Using  X|#X;#  •  ■  ■  #x,#,  and  (zq*)''  M  can 
compare  n  with  2':  if  a  s  2'  then  we  are  done,  otherwise  we  will  construct  (zo#)'  from 
(zo#)''  with  0(1)  reversals. 

Finally,  from  (zq^)^  ,  we  can  easily  construct  the  desired  y|#y;#  •  •  •  #>■,.#. 
Q.E.D. 

A  more  complicated  problem  is  computing  the  transitive  closure  of  a  matrix.  There 
are  known  efficient  parallel  algorithms  for  it.  Since  reversal  is  intimately  related  to 
parallel  time,  it  is  not  surprising  that  we  can  get  a  bound  on  the  reversal  complexity  of 
transitive  closure  that  is  close  to  the  corresponding  parallel  time  bound.    It  rums  out  that 


the  key  to  computing  transitive  closure  is  a  fast  matrix  transposition  algorithm: 

Lemma  3.  (Matrix  Transposition)  There  is  a  2-iape  Turing  machine  M ,  such  thai  given 
an  nXm  matrix  A   =   (a,,),  stored  in  row  major  form,  M  can  compute  the  transpose  A 
of  A,  storing  A^  in  row  major  form  m  one  of  its  tapes,  within  C>(log(n^m))  reversals. 

Proof.  We  may  assume  that  n  =  m  =  2*  for  some  integer  *  2  1  and  each  entry  of 
A  has  the  same  length.  By  lemma  2,  it  is  clear  that  M  can  uses  0(\ogn)  reversals  to 
make  A  satisfy  this  property. 

For  each  i  -  0,  1,  .  .  .  ,  i,  and  for  j  =  1,  2,  .  .  .  ,2',  let  A  '"  denote  the  /ix2*^"' 
matrix  consisting  of  the 

(0--l)2*-'-l)",  (0-l)2'--2)""' 0-2*-')'" 

columns  of  A.    For  example,  A/''  is  the /"  column  of  A  and  for  each;, 

A   =  A,<"|  AV"|    ■  ■  ■     I  A,,"' 

An  i-row  representation  of  A  is  a  string  consisting  of  the  row-major  forms  of 
A/'\  A,*'',  .  .  .  .A^y\  separated  by  'S',  in  that  order. 

Let  A"*  denote  the  /-row  representation  of  A.  The  lemma  follows  if  we  can  show 
how  to  obtain  A*'""  from  A'"  in  0(1)  reversals.  This  is  because  the  input  is  A'°'  (the  row 
major  form  of  A)  and  the  desired  output  is  A'*'  (the  column  major  form  of  A). 

In  order  to  do  the  A^'*  -A""'*  transformation,  we  need  some  auxiliary  data.  For 
this,  define  the  block  pattern: 

P''>  =  ((wo'"'-'w,'"'"#)''j)^'  (/  =  0 k-l) 


where  h-^,  =  0^  and  w.  =  V,   s   is  the  length  of  a  element  of  matrix  A.      Each 


w, 


4-1-1 


(resp.  w,=  "'"')  'marks'  a  left  (resp.  right)  half  of  the  rows  of  A/'>,  ;  =  1,  2 2'.   /"'' 

helps  us  to  'separate'  the  rows  of  each  A^^''  into  'left  half  and  'right  half. 

We  may  inductively  assume  that  each  tapes  has  two  tracks  with  the  following 
contents:  A^''  is  on  track  1  and  /'''*  is  positioned  directly  underneath  A^'*  on  track  2.  The 
case  I  =  0  is  easy  to  initialize  in  0(1)  reversals.  A  copy  of  />('*"  can  be  made  on  track  2 
of  tape  2  using  /"''.  Now  it  is  easy  to  see  how  to  obtain  A*''"  on  track  1  of  tape  2. 
Q.E.D. 

Lemma  4.   (Parallel  Copying  Lemma)  There  is  a  2-tape  Turing  machine  M   which, 

given  an  input  x  =  0"#;t,jC2       •  •    x„,  where  x,  €  2',  (/  =   1 m),  produces  string 

y  =  Xj^'x,^"    ■  •  •    x^^"     within    0{n)    tape    reversals,    where    we   assume    that  M    can 
recognize  the  boundary  between  blocks  x,  and  x^^^,  i  =  1 m-1. 


Proof.    Using  0{n)  reversals,  M  can  get  a  string  S  =  (X|X,    •  •  •    x^)-".    Notice  that 
x,-'x,-"    ■  ■  ■    jt,„-"  is  the  transpose  of  S  if  we  regard  S  as  being  a  m  x  2"  matrix.    Q.E.D. 

From  the  last  two  important  lemmas,  we  get  immediately, 

Lemma  5.  (Matrix  Multiplication)  There  is  a  2-tape  Turing  machtrie  M  such  that, 
given  two  nXn  Boolean  matrices  A  =  (a,,)  and  B  =  (b  .)  in  row  major  form.  M 
computes  their  product  AB   =   (c    )  in  O(logn)  reversals. 

Proof.    By  lemma  3,  we  can  obtain  the  transpose  B^  of  B  within  O(logn)  reversals. 
Let 

A   =    (a,,  a,   a-,,  a-.         ■  ■    a   ,  •    a     ) 

and 

B'  =   (i,,    •  ■       b„,b,,  ■    b„,    ■  ■  ■    b,„       ■  ■    b„„). 

By  lemma  4,  we  can  get 

^1  =   (''11    •  •  •    o,„r{a.,    ■  ■  ■    a.„r    ■  ■  ■    (a„,    ■  ■  ■    a„„Y 

and 

^^\    =    (*11      "        ■      ''nl^l2     ■    ■    ■     ^n2      ■    ■    ■     ''in      ■    ■    ■     ^nn)" 

within  O(logn)  reversals.    Then  0(1)  more  reversals  will  give  AB .    So  O(logn)  reversals 
are  enough  for  nxn  Boolean  matrix  multiplication.    Q.E.D. 

Lemma  6.  (Matrix  Transitive  Closure)  There  is  a  2-tape  Turing  machine  M  such  that 
given  an  nxn  Boolean  matrix  A  =  (a,.),  stored  in  row  major  form,  M  computes  the 
transitive  closure  A'  of  A  in  O(log'n)  reversals. 

Proof.     Since  A'  =  (E  +  A)" ,   where  E   is   the  identity   matrix,  we  can  reduce   this 
problem  to  logn  matrix  multiplications.  Q.E.D. 

Sorting  seems  a  very  important  tool  for  reversal  complexity.    Here  we  give  an  upper 
bound  for  sorting. 

Lemma  7.  (Sorting)  There  is  a  2-tape  Turing  machine  M  which,  given  a  list 
X|#Xt#  ■  •  •  #x^#  on  its  input  tape,  can  construct  the  sorted  list 
x^ji)#x„,2)^  •    x„(„,#   in  O(logn)  reversals.    Here  each  item  x,  is  assumed  to  have 

the  form  {k,id^)  where  t,  is  a  binary  number  representing  the  sort  key  and  J,  is  the  data 
associated  with  k,. 


Proof.  By  lemma  2,  we  can  assume  that  the  input  list  is  in  normal  form.  We  will  be 
implementing  a  version  of  the  well-known  merge  sort.  The  computation  will  be 
accomplished  in  k  phases,  where  n  =  2'.  To  initialize  the  first  phase,  we  use  the  parallel 
copying  lemma  to  construct  on  tape  1  a  siring  of  the  form 

(X|#)''s(x;#)''5  •  ■  •  *{x„*ys 

where  $  ^   2U(#},  using  0{\ogr\)  reversals. 

1-1 
For  i  =   1,  2 k,  iet/(0  =2*-  ^.2'  =  I'-l'^l.    So /(I)  =  T  and /(i)  =  2. 

At  the  beginning  of  the  (i-l)"  phase  (/  =  0,  1 k.-\),  we  will  inductively  have  on 

tape  1  a  string  of  the  form 

B.#B.#:        ■       #fi,..,# 

where  each  B.  has  the  form 

where  x,  ,,  x, -,,  .  .  .  ,x  ^,  are  distinct  items  from  the  original  list,  already  sorted  in  non- 
decreasing  order.  Call  S,  the  i'*'  block.  The  substring  in  a  block  between  two  consecutive 
'%'  is  called  a  subblock.    Observe  phase  1  has  been  properly  initialized  above. 

We  also  need  some  auxiliary  data  to  carry  out  the  induction.  Let  w^  =  0^1  where 
s  =  |x,|  (for  any  ()■  F<"r  this,  we  assume  that  at  the  beginning  of  the  i"  phase,  we  also 
store  on  tape  1  the  patterns 

/>,  =  (V#)" 

and 

Note  that  2,  can  be  easily  obtained  from  P,.,  and  C-i  within  0(1)  reversals  and  f,  can 
be  obtained  from  /',_,  in  another  0(1)  reversals.    Again,  phase  1  is  easily  initialized. 

Let  us  now  see  how  we  carry  out  phase  j  +  l.  First  we  split  the  string 
fl|#fl2#  •  ■  •  #fi,j-,#  into  two  lists  containing  the  odd  and  even  numbered  blocks:  tape 
1  now  contains  S,#fl3#  •  •  •  #B^,_,_^#  and  tape  2  contains  B^^B^il^  ■  •  •  H^B^t^.W^. 
This  can  be  done  using  0(1)  reversals. 

Our  goal  is  to  'merge'  Siy-i  ^'^h  Bjj,  for  all  y  =  1,  2,  .  .  .  ,2*"'"'  in  paralleL    Let 
«2;-.  =  (>,5>'^''"%(M)^^'*'^%    ■  •  •    %(y^fy^"'^ 
and 

B.J  =  (z,5/"-"%(z2S)^('-'>%    •  •  •    %(z2^)/('-i> 

We  begin  by  comparing  the  first  copy  of  y,  with  the  first  copy  of  z,.  Suppose  y,  a  z, 
('  s  '  here  is  the  ordering  on  the  items,  as  defined  by  the  sort  key).  Then  wc  'mark'  the 
first  copy  of  z,  and  move  head  2  to  the  first  copy  of  Z;  in  the  block  fl,;-  Wc  can  compare 
the  second  copy  of  y,  with  this  copy  of  z,,  marking  the  smaller  of  y,  and  Zj.  If  yj  is 
smaller,  we  move  to  the  first  copy  of  yj  and  next  compare  yj  with  Zj.    Otherwise,  z^  is 


smaller  and  we  next  compare  the  3'*^  copy  of  y,  with  the  first  copy  of  z,.  In  general, 
suppose  we  compare  (some  copy  of)  y  with  (some  copy  of)  z,.  We  mark  the  smaller  of 
the  two.  If  y,  is  smaller,  we  next  move  head  I  to  the  first  copy  of  y,.  ,  and  move  head  2 
to  the  next  copy  of  z,  and  proceed  with  comparing  these  copies  of  >,.  ,  and  z,;  Otherwise, 
z^  is  smaller  and  the  roles  of  y  and  z  are  exchanged  in  the  description.  (For  correctness, 
we  will  show  below  that  there  is  a  sufficient  number  of  copies  of  each  item.) 

Eventually,  one  of  the  blocks  is  exhausted.  At  that  point,  we  scan  the  rest  of  the 
other  block  and  mark  the  first  copy  of  each  remaining  item.  Notice  that  each  subblock 
now  contains  exactly  one  marked  copy.  We  then  proceed  to  the  next  pair  of  blocks 
(fl;,.i  and  «.,^0. 

After  we  have  compared  and  marked  all  the  pairs  of  blocks,  we  will  get  two  strings 
5|  and  5,,  with  one  copy  of  an  item  in  each  subblock  of  each  block  marked,  on  the  two 
tapes,  respectively.  Our  goal  is  to  produce  the  merger  of  fl-.._|  and  fl,,  from  5,  and  S->. 
Call  the  merged  result  B'  .  We  will  scan  5,  and  output  on  tape  2  a  partially  instantiated 
version  of  B'j.  Our  preceding  marking  algorithm  ensures  that  if  a  marked  copy  of  item 
w  in  ,51  is  preceded  by  h  copies  of  w  then  w  has  been  compared  to  and  found  larger  than 
h  other  items  in  5^.  In  the  merge  result,  we  want  to  place  these  h  smaller  items  before 
w.  Since  each  item  should  be  repeated  /(/  +  2)  times  in  its  subblock  in  B'  we  must 
precede  w  with 

Mi-kol/('^2)) 

blank  spaces  to  accommodate  these  h  subblocks  which  will  be  placed  there  in  another 
pass.  With  the  help  of  the  pattern  G,.;,  we  can  leave  the  required  amount  of  blank 
spaces  for  each  subblock  in  B'j.  More  precisely,  we  make  a  copy  of  2,.-,  in  a  track  of 
tape  2  and  use  it  as  a  'template'  which  has  the  block  and  subblock  structure  already 
marked  out.  As  we  scan  string  S,,  for  each  unmarked  copy  of  an  item  preceding  its 
marked  version,  we  skip  a  subblock  of  Q,^2-  ^^en  we  reach  a  marked  version  of  item  h' 
in  S|,  we  will  copy  that  /(i-^2)  successive  copies  of  w  into  a  subblock  of  2,.-.  To  see 
that  there  arc  enough  copies  of  w  on  S^  following  this  marked  tv,  observe  that  there  are  a 
total  of  /((+  1)  copies  of  iv  in  its  subblock  in  5,  and  since  at  most  2'  copies  of  w  precedes 
its  marked  version,  there  are  ^  /(i+l)-2'  a  /((-^2)  copies  left.  When  we  finish  this 
process,  we  scan  the  partially  formed  B'^  and  the  string  5,  simultaneously,  and  fill  in  the 
remaining  subblocks  of  fl'^.    We  finally  have  a  new  list  of  blocks: 

B',#fl',#    •  •  ■    *B\       # 
where  each  B'j  has  the  required  form 

This  completes  the  proof     Q.E.D. 


For  some  applications,  we  need  the  sorting  algorithm  to  be  stable: 

Corollary.     (Stable  Sorting)   The  above  sorting  algorithm  can  be  made  stable  in  this 
sense:    if  the  output  list  is 

where  each  Jt^;,,  has  the  form  (Jl:^,,,C  rf„,,,),  then  k^^,)  =   <:„(,-n  implies  ttii)  <  tt((  +  1) 
for  all  i  =   1 n  -  1. 


Proof.  To  do  this,  we  first  number,  in  another  track  (say  track  K)  of  the  tape,  the 
items  in  the  string  to  be  sorted  sequentially.  This  takes  C>(logn)  reversals.  Next  use  the 
method  of  lemma  7  to  sort  the  string.  After  sorting,  those  items  with  a  common  sort  key 
will  form  a  contiguous  block.  We  apply  the  sorting  method  again  to  each  block  of  items, 
where  we  now  sort  each  block  by  the  numbers  of  items  (as  written  on  track  K).  To  do 
this  in  O(logn)  reversals,  we  must  do  this  second  sortmg  for  all  the  blocks  simultaneously 
—  our  above  method  can  be  modified  for  this.    Q.E.D. 


3.    Simulating  Deterministic  Space  By  Reversal 

In  this  section,  we  will  prove  one  of  our  main  results:  for  s(n)  =  n(logn), 
DSPACEis)  C  DREVERSAL.is).  . 
Thus,  for  deterministic  Turing  machines,  reversal  as  a  complexity  resource  is  at  least  as 
powerful  as  space.    This  is  a  direct  consequence  of  the  following  theorem. 

Theorem  8.  Suppose  s{n)  is  reversal  constructible  and  s{n)  =  fl(logn).  Then  any 
language  L  €  DSPACE(s)  can  be  accepted  by  a  2-tape  deterministic  Turing  machine  M 
within  0{s)  reversals. 

Proof.    First  we  assume  that  s(n)  =  fl(n). 

Suppose  a  deterministic  Turing  machine  N  accepts  L  in  space  s{n).    We  construct  a 
2-tape  Turing  machine  M  that  accepts  L  in  0{s{n))  reversals. 

(a)     Given  an  input  x  of  length  n,  M  first  generates  a  string  of  the  form 

Wi  =  0#r#2#    ■  •  •    #2'^"'-l# 
Since  s{n)  is  reversal  constructible,  lemma  1  implies  that  this  can  be  done  in  0{s{n)) 
reversals.    Notice  that  we  can  suppose  that  all  the  possible  configurations  of  A^  on 
input X  are  included  in  the  string  W^. 
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(b)     From  string  W,,  within  0(1)  reversals,  we  construct  another  string 


W;      =      Zo#^i# 


#^;.<.,^,# 


such  that   \z„\  =  s{n)  and  z,^  is  the   'successor  configuration'  of  configuration  in   in 

the  string  W',,  m   =  0,  1 2'""-  1,  as  follows:    First  we  scan  the  string  W,  from 

left  to  right.  When  we  have  completed  scanning  a  configuration  m  in  the  string  W,, 
we  know  what  the  next  action'  of  this  configuration  of  iV  is,  and  hence  we  can  put 
this  information  at  the  right  end  of  the  corresponding  configuration  m  m  the  string 
W|  (for  this,  we  only  need  to  make  the  alphabet  of  M  large  enough).  Then  we  go 
back  through  the  string  W,  from  right  to  left,  using  the  information  recorded  in  the 
last  scan,  to  obtain  W,. 

Combining  strings  W,  and  HS  into  one  tape  ,  we  get  a  string 


W,  =  u,#u-,# 


#", 


with  each  u,^  (m   =    1,2, 


,  2')  is  a  2-track  string  of  the  form 


C„ 


CJ 


(c) 


where  C„.  and  C „'  are  configurations  of  A'  on  input  .r  such  that  C„  —  C„,'. 

Next    we    construct    two    identical    strings    5[    and    S-,    on    tapes    7",    and    7-    of   M, 
respectively,  using  0{s)  reversals: 

5,  =  5,  =  ((u,J)''#(«.5)2'#    •  •  •    #(u„J)2'%)-' 
Wc    will    call    each    substring    of    the    form    (u,J)-'#(u;5)''#       •       #(u,,X)-'%    a 

segment  of  5,  or  5,  and  for  each  j  =  1,2 2'  call  the  substring  (u.S)^'  a  u  - 

subsegment. 

(d)     With  these  rwo  strings,  we  now  find  the  computation  path 


P  =  C, 


C, 


of  N  on  input  x  in  another  0(1)  reversals  as  follows. 
(dl)  Consider  the  first  u,  -subsegment  in  5,  where  the  upper  track  of  u,  contains  the 
initial  configuration  C,  of  A'  on  input  x.  Begin  by  marking  the  first  copy  of  u,  in 
this  subsegment  and  place  the  head  H^  of  T,  between  the  first  and  second  copies  of 
u,  in  the  u,  -subsegment.  (By  'marking'  of  a  copy  of  u,  ,  wc  mean  a  special  symbol 
is  placed  at  the  end  of  that  copy  on  a  separate  track.)  Moreover,  place  the  head  W, 
of  7-,  at  the  beginning  of  the  string  Sj; 

(d2)  Inductively,  suppose  we  have  already  found  and  marked  a  sequence  u,  ,  u,  ,  .  .  .  ,m, 
on  the  string  5,  on  tape  7,  such  that  the  lower  track  of  u,  is  identical  to  the  upper 
track    of    tt,     ,   y  =  1,  2.  .  .  .  ,^- 1.     Hence    the    upper    track    of    u,    contains    the 
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configuration   C,   in   the   path  P,   /  =   1,  2 q.     Suppose   also  that  head  W,    is 

placed  between  the  first  and  second  copies  of  u,    in  the  q'''  segment  in  5,  and  that 

i 

head  H-,  is  placed  at  beginning  of  the  q'''   segment  in  5;.     Our  goal  is  to  find  a 

subsegment  {u      $)-'  in  the  q'^   segment  of  S-,  such  that  the  lower  track  of  u,    is 

«-i  "  » 

identical  to  the   upper  track  of   w,  To  do  this,   we  compare   the   lower  track  of 

«- ' 

successive  copies  of  u,  in  the  subsegment  (u,  $)■  of  the  q'''  segment  of  5,  with  the 
upper  track  of  the  first  copy  of  the  configuration  in  each  subsegment  of  the  q''' 
segment  of  S,. 

Eventually,  we  will  find  the  desired  subsegment  («,  $)-'  in  the  q'''  segment  of  the 
string  5-,.    The  head  H^  will  now  be  between  the  first  and  second  copies  of  u,       in 

subsegment  (u,     J)-'.    We  will  use  the  2'-  1  unscanned  copies  of  this  subsegment  in 

1- ' 

S^  to  find  the  same  subsegment  («,     $)-    in  the  (^-1)"  segment  in  the  string  5,. 

<?■  ■ 

When  we  find  that  u,     -subsegment  in  5,,  the  head  //,  will  be  between  the  first  and 

«-' 

second  copies  of  u,       and  we  move  the  head  H-,  to  the  beginning  of  the  {q+iy 

segment  in  string  ^2.    Now  we  are  ready  for  next  iteration. 

The  fact  that  there  are  2'  copies  of  Uj  in  each  M^-subsegment,  (J  =   1,  2 2^), 

and  there  are  2'  segment  in  each  of  the  strings  5,  and  Sj  ensures  that  we  always  have 
enough  copies  of  each  u    (J  =  1,2,...  .2^),  for  the  comparisons  in  our  algorithm. 

If  N  accepts  or  rejects  x,  then  at  some  moment  in  the  above  iteration,  M  will  realize 
it.  M  will  accepts  or  rejects  accordingly.  If  iV  is  in  an  'infinite  loop'  then  M  will  exhaust 
one  of  the  two  strings  5,  and  5,  before  it  finds  a  terminal  configuration  of  N  on  input  x: 
M  will  also  reject  x  in  this  case. 

Finally  we  indicate  how  to  modify  the  above  proof  for  the  case  s{n)  =  fl(logn). 
Instead  of  storing  the  entire  input  string  x  with  each  configuration,  we  attach  to  each 
modified  configuration  of  N  only  the  position  of  the  input  head.  Then  by  sorting  the 
string  W,  by  the  positions  of  the  input  head,  followed  by  one  sweep  of  the  input  x,  we 
are  able  to  record  the  'current  input  symbol'  into  the  each  of  modified  configurations.  A 
similar  sort  can  be  applied  to  M^,  '°  '^^  construction. 

This  completes  the  proof.    Q.E.D. 

We  should  remark  that  a  result  of  Rytter  and  Chrobak  [10]  may  appear  to  be 
stronger  than  theorem  8  for  the  case  s(n)  =  logn.  Their  result  says  that  0(1)  reversals 
are  sufficient  to  simulate  O(logn)  space.  However,  a  closer  examination  reveals  that 
they  do  not  count  reversals  on  the  input  tape  -  in  fact  their  simulation  makes  a 
polynomial  number  of  reversals  on  the  input  tape. 
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4.    The  Relationship  Between  Space  and  Reversal 

Borodin  [3|  proved  that  a  s{n)  space-bounded  computation  of  nondeterministic 
multitape  Turing  machines  can  be  simulated  by  uniform  circuits  of  depth  0{s-{n)). 
Later,  Pippenger  [9|  showed  that  uniform  circuits  of  depth  d{n)  can  be  simulated  by 
0(d^{n))  reversal-bounded  deterministic  multitape  Turing  machines.  However,  Borodin 
and  Pippenger  were  using  two  different  kinds  of  uniform  circuits:  Borodin's  circuits  are 
"space  uniform'  while  Pippenger's  circuits  are  'reversal  uniform'.  A  closer  investigation 
reveals  that  Borodin's  circuits  are  in  fact  'reversal  uniform'  too.  1  hus  combining  their 
results  with  a  slight  modification,  we  obtain: 

NSPACE{s)  C  DREVERSAL(s^)  . 

This  yields  the  strongest  result  for  the  relationship  between  space  and  reversal  complexity 
until  now. 

Using  theorem  8,  we  can  greatly  improve  this  result: 

Theorem  9.  Suppose  s(n)  is  reversal  constructible  and  s{r\)  =  n(logn)  Then  any 
language  L  €  NSPACE(s)  can  be  accepted  by  a  2-tape  deterministic  Turing  machine  A/ 
within  O(s-)  reversals. 

Proof.    By  theorem  8  and  Savitch's  result  [11]: 

NSPACEis(n))  C  DSPACE(s-in)) 
we  get  our  desired  theorem.    Notice  that  the  reversal  constructibility  of  the  function  s{n) 
implies  the  reversal  constructibility  of  the  function  s'{n).    Q.E.D. 

REMARK:  In  fact,  we  can  give  a  direct  proof  for  this  theorem,  using  the  a  method 
different  than  that  used  in  theorem  8.  For  this,  we  use  lemma  6  to  compute  the  transitive 
closure  of  the  transition  matrix  of  N  on  input  x. 

We  can  get  an  even  stronger  result.    Borodin,  Cook  and  Pippenger  [2]  have  shown 
the  following  result: 

R5PACE(s)  C  DSPACE(s') 
where  RSPACE{s)  denotes  the  class  of  languages   which   are  accepted  by  probabilistic 
Turing  machines  within  s{n)  space.    With  our  theorem  8,  we  get: 


Theorem  10.  Suppose  s{n)  is  reversal  constructible  and  s(n)  =  n(logn).  Then  any 
language  L  (.  RSPACE(s)  can  be  accepted  by  a  2-tape  deterministic  Turing  machine  M 
within  0{s-)  reversals. 

The  restriction  of  constructibility  of  the  function  s(n)  is  not  severe      We  observe 
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that  most  common  complexity  functions  are  reversal  constructible: 

Lemma  11.  For  all  integers  t  >  1,  n'  flog*n,  k" .  n" ,  respectively)  is  logn  flogn,  n, 
n,  respectively )-reversal  constructible . 

The  proof  of  this  lemma  is  omitted. 

If  we  do  not  want  to  restrict  the  function  s{n)  to  be  constructible,  then  we  have 
following  theorem: 

Theorem  12.    Let  s{n)  be  an  arbitrary  complexity  Junction  with  s(n)   =   n(logn).    Then 

DSPACE{s)  C  DREVERSAL{5-). 
and 

NSPACE(s)  C  DREVERSAL(s^) 

Proof.  The  proofs  for  these  two  results  are  similar,  so  we  only  outline  the  proof  for 
the  first  one. 

We  use  the  well-known  technique  of  guessing  the  value  of  s{n),  starting  from 
s{n)  =  logn,  logrt^l,  ....  For  each  guessed  value  h,  we  use  0(h)  reversals  doing  the 
computation  in  the  proof  of  theorem  8.  If  the  input  is  in  the  language,  we  will  accept 
when  h  =  s(n)  using  at  most 

2    0{h)   =  0{s{n)-) 

^  =  lo  g  n 

reversals.    Q.E.D. 

To  obtain  results  in  which  we  simulate  reversal-bounded  Turing  machines  by  space- 
bounded  Turing  machines  efficiently,  we  first  note  following  lemma.  (See  Yap  [13]  for  a 
proof.) 

Lemma  13.  //  a  language  L  is  accepted  by  a  deterministic  Turing  machine  M  within 
reversal  r{n)  =  Cl(logn),  then  M  simultaneously  accepts  within  time  0(2'^)  and  space 
OiT). 


Theorem  14.    For  r  =  il{logn), 

DREVERSAL{r)  C  DSPACE{r~) 

Proof.    Simon  [12,13]  has  shown  that  D-TIME-REVERSAL{t,  r)  C,  DSPACE{r[ogt). 
By  lemma  13,  we  get  the  conclusion  immediately.    Q.E.D. 


14 


We  close  this  section  by  following  important  corollary: 

Corollary. 

DSPACE{n'^^^>)  =  NSPACE{n°'^')   =  DREVERSAL(n°'^^) 

DSPACE(\og'^''^'n)  =  NSPACE{\og°'^^n)  =  DREVERSAL{\og°'^^n) 


5.    Tape  Reduction  Theorem  for  Reversal  Complexity 

Now  we  are  ready  for. the  tape  reduction  result  for  reversal  complexity.  In  fact,  if 
we  restrict  the  complexity  functions  to  be  reversal  constructible,  then  from  theorems  14 
and  8,  we  get  immediately: 

If  r(n)  =   Cl{\ogn}  and  r-(n)  is  reversal  constructible .  then 
DREVERSAL{r)  C  DREVERSAL.Jr-) 

However,  here  we  would  like  to  remove  the  restriction  of  constructibility  of  the 
complexity  functions  and  give  a  direct  proof  for  general  complexity  functions.  For  this 
we  introduce  the  concept  of  'phase'. 

To  define  phase  we  first  give  a  precise  definition  of  reversal  complexity.  Let 
^  ~  (CJiso  ^^  ^  computation  path  of  a  fc-tape  Turing  machine.  We  say  that  head  h 
(h  =  0,  .  .  .  .k)  tends  in  direction  d  (for  d  i  {-1,  +1})  in  C,  if  the  last  transition 
C^_i  I —  C^  (J  s  (•)  preceding  C,  in  which  head  h  moves  is  in  the  direction  d.  If  head  h 
has  been  stationary  up  till  C,  we  say  head  h  tends  in  the  direction  ^  =  0  in  C,.  We  also 
say  that  head  h  has  tendency  d  if  it  tends  in  direction  d.  It  is  important  to  note  that  the 
head  tendencies  of  a  configuration  C  are  relative  to  the  computation  path  in  which  C  is 
embedded.  We  say  head  h  makes  a  reversal  in  the  transition  C^.,  \ —  C^  in  C  if  the 
tendency  of  head  h  in  C,,,  is  opposite  to  its  tendency  in  C ,.  We  say  C^_,  ^ —  C.  is  a 
reversal  (transition).  The  reversal  of  C  is  the  total  number  of  reversals  made  by  all  the 
heads  over  the  entire  computation  path. 

Observe  that  changing  the  tendency  from  0  to  ±  1  is  not  normally  regarded  as  a 
reversal.  However,  for  the  purposes  of  some  proofs,  we  may  regard  C  ,  ' —  C  to  be  a 
reversal  if  the  tendencies  of  some  head  h  in  C,.. ,  and  in  C^  are  different:  this  only 
overcounts  the  number  of  reversals  by  it  +  1,  once  for  each  of  the  heads  /i  =  0,  ...  .it. 
Each  computation  path  C  can  be  uniquely  divided  into  a  sequence  of  disjoint  partial 
computation  paths  Pq,  P,,  .  .  .  ,  where  every  configuration  in  P,  (/  =  0,  1,  .  .  ,)  has  the 
same  head  tendencies.    Each  P,  is  called  a  phase.    If  C,  i —  C,.,  I —     •  •  •     I —  C,.,  is  a 
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phase,  then  C,  and  C,.,  will  be  called  the  start  configuration  and  end  configuration  of  the 
phase  P^.  The  transition  from  one  phase  P,  to  the  next  P,_|  is  caused  by  the  reversal  of 
at  least  one  head.  Clearly  the  reversal  of  C  bounded  by  A:  ^  1  times  the  the  number  of 
phases  in  C . 

Lemma  15.  //  a  language  L  is  accepted  by  a  Turing  machine  N  making  at  most  r{n) 
reversals  on  input  of  length  n,  then  there  is  a  Turing  machine  M  which  satisfies 
following  conditions: 

(1)  M  and  N  have  the  same  number  of  tapes; 

(2)  M  accepts  L  in  at  most  r(n)  reversals; 

(3)  m  each  step  of  M  ,  at  least  one  tape  head  moves;    and 

(4)  ihe  head  of  each  work  tape  of  M  does  not  change  the  symbol  it  is  scanning  unless 
It  is  about  to  leave  the  square. 

Proof.  Given  a  Turing  machine  jV  accepting  L,  by  increasing  the  'finite  state 
complexity'  of  N ,  we  can  obtain  the  Turing  machine  M  satisfying  the  conditions  stated  in 
the  lemma.    Q.E.D. 

Theorem  16.  (Tape  Reduction  for  Reversals)  Let  r{n)  be  any  complexity  function 
where  r(n)  =  n(logn).  //  L  is  accepted  by  a  multitape  Turing  machine  within  r{n) 
reversals,  then  it  is  accepted  by  a  2-tape  Turing  machine  M  that  makes  at  most  0{r^) 
reversals,  i.e. 

DREVERSAL{r)  C  DREVERSAL-,{r-) 

Proof.    We  will  use  a  direct  simulation. 

Suppose  L  is  accepted  by  a  *-tape  Turing  machine  N  within  r{n)  reversals  on  input 
of  length  n.  Without  loss  of  generality,  we  assume  that  N  satisfies  the  properties  stated 
in  lemma  15. 

We  will  construct  a  2-tape  Turing  machine  M  which  will  simulate  N  and  make  at 
most  0{r^{n))  reversals. 

M  uses  k+\  tracks  on  one  of  its  tapes,  say  7",,  to  hold  the  configurations  of  N, 
namely,  track  0  corresponds  to  the  input  tape  of  A'  and  track  j  (1  <  y  <  k)  corresponds 
to  the  tape  j  of  N.  Sometime  we  also  talk  about  'the  head  position'  or  'the  head 
tendency'  of  track  j ,  referring  to  the  corresponding  simulated  head  of  A'. 

For  the  start  configuration  of  phase  /  in  the  computation  of  N ,  M  can,  within  0(1) 
reversals,  make  the  corresponding  configuration  of  7"]  satisfy  following  properties: 

(1)      all  heads  of  the  tracks  of  7,  will  tend  to  the  right  in  that  phase; 
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(2)      all  heads  of  the  tracks  of  7",  are  at  the  square  0. 

So  the  start  configuration  of  each  phase  of  A'  is  represented  on  the  tracks  of  7",  as  follows: 

track  0 
track  1 


X   X   X   X   X 

t-x  XXXXXXXXXXXX 

XXX 

t-Jt   X    X    X   X   X 

X   X  X   X 

r-X   XXX    X   XXX   XXX 

track  * 

where  the  '''  stands  for  the  position  of  the  head  of  the  track  and  '  -  '  indicates  the 
tendency  of  the  tape  head. 

By  a  full  irace  of  phase  /  of  A\  we  mean  a  tuple  of  the  form 

T  =    <  <? ,  /ij^,  Qq h^.  a^  > 

where  q  is  a  state  of  A',  a.'s  are  tape  symbols  of  N  and  the  /j  's  lie  in  the  range 
0  s  /i  <  c',  where  c  is  a  constant  depending  on  N.  Intuitively,  h  denotes  a  head 
position  on  tape  j  of  N  and  a  is  the  symbol  under  the  head.  The  proof  of  lemma  13 
shows  that,  within  the  /'''  phase,  no  head  position  /i,  can  be  out  of  the  range  0  s  /i,  <  c' . 
We  will  store  each  full  trace  in  2k ^3  tracks  of  tape  7-,  of  W ,  with  the  first  k-^l  tracks  for 
the  hj's,  the  k~  V^  track  for  the  <?,  and  the  last  k  -  1  tracks  for  the  a.'s. 

A  partial  trace  of  phase  /  of  /V  is  obtained  by  replacing  any  of  the  components  of  a 
full  trace  by  a  distinguished  symbol  '*'. 

Let  Kq   be  the  set  of  all  partial  traces  of  the  ;'''  phase  of  N.    We  define  a  quasi- 
ordering  on  Cq'  as  follows:    if 

T    =     <     -,hQ,    - h,,    -     > 

where  '-  '  means  that  we  do  not  care  what  is  in  that  position,  and  if 

t'  =  <   -  ,  /I'o.  - 'I't.  -   > 

then  T  :<  t'  iff  /i,  s  h' ,  for  I  =  0,  .  .  .  ,k.  U  t  <  t'  and  t'  <  t  then  we  say  they  are 
equivalent,  and  write  t  -  t'.  (This  is  a  quasi-ordering  rather  than  a  partial  ordering 
because  t  ^  t'  does  not  mean  t  =  t'.) 

M  will  simulate  phase  i  of  N  using  the  following  steps  (details  for  accomplishing 
these  steps  follow  afterwards). 

(Step  1).  M  generates  on  the  tape  7",  all  partial  traces  of  the  i'"'  phase  of  \'  which  has  the 
form: 

<  <7.  '•o-  • ''*.  •  > 

i.e.  all  the  tape  symbols  are  replaced  by  '»'.    Let  W,"^  denote  this  list  of  partial 

traces. 

(Step  2).  Next,  M  converts  each  partial  trace  in  W,'''  into  a  full  trace  by  filling  in  the 
correct  tape  symbol  for  each  head  position  on  each  track  of  the  trace.  More 
precisely,  if  h,  is  the  head  position  of  N  on  track  /,  and  j,  is  the  symbol  in  the 
h/''  square  on  track  /  of  tape  7,,  then  we  place  a,  into  the  (A-^Z)*/'"  track  in 
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the  tape  T.  under  the  /i,.    Let  Wj"  denote  this  list  of  full  traces. 

(Step  3)  M  does  a  'topological  sort'  of  the  list  WV"  of  full  traces  found  on  tape  T..  By 
this  we  mean  that  if  t  and  t'  are  full  traces,  t  *  t'  and  x  and  t'  are  not 
equivalent,  then  t  appears  before  x'. 

(Step  4).  For  each  full  trace  x  in  the  list  W,"',  construct  its  successor  partial  trace  x'. 
We  place  x'  on  another  2k^3  tracks  directly  below  x  on  the  tape  7",.  By  the 
successor  partial  trace  x'  of  full  trace  x,  we  mean  that  if  x  corresponds  to  a 
configuration  C  of  A'  and  C  \ —  C  in  phase  i,  then  x'  corresponds  to  the 
configuration  C  . 

(Step  5).  Similar  to  the  proof  of  theorem  8,  using  the  string  on  tape  T,  obtained  in  step 
4,  we  can  place  marks  on  the  upper  2k  ^3  tracks  of  the  tape  T,  to  determine  a 
sequence  P^  of  full  traces 

Xq  h—  Xi  •   ■   ■      I —  X, 

corresponding  to  the  partial  computation  path  in  phase  i  of  the  computation  of 
A',  where  Xg  and  x,,  respectively,  are  the  full  traces  corresponding  to  the  start 
and  end  configurations  of  phase  /. 

(Step  6).     By  the  definition  of  the  quasi-ordering,  we  have 

To    ^    T,    ^       •    •    •       <    T, 

and  they  appear  in  this  order  in  the  tape  Tj,  so  we  can  read  the  P,  sequentially 
on  the  tape  7-,  and  modify  the  contents  of  each  track  of  tape  Tj  accordingly. 

(Step  7).     Prepare  7,  for  next  phase  of  A^. 

M  will  repeat  the  above  algorithm  until  it  finds  that  N  accepts  or  rejects,  then  M 
accepts  or  rejects  accordingly. 

Clearly,  \t  accepts  iff  A'  accepts. 

Now  we  present  the  details  to  make  sure  that  M  makes  at  most  O(r^)  reversals. 

STEP  1  : 

By  lemma  13  we  can  assume  that  hj  is  a  c-ary  number  so  its  length  \hj\  is    ^  i,  for 
all  j  =  0,  1,  .  .  .  ,  t.    Similar  to  lemma  1,  we  can  generate  string 

S,  =  0,#1,#    •  ■  •    #(c'-l), 
within   0{i)   reversals,   where   m^  is  the  c-ary  representation  of  integer  m   of  length  i, 
prefixed  with  O's  if  necessary. 

From  5,,  we  can  construct  k  +  l  strings  defined  as  follows: 

S  (0)  =  S'" 
and 

^/"-'^  =  ((o,#)^'*(i,#)^"  •  •  •  ((c'-i),#rY*"" 

>i  =  1 k.     By    lemma    4,    each    of    these    5/''*"'s    can    be    constructed    within 
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0(log(c'*))  =  0(0  reversals. 

If  we  put  string  5/''" ''  in  the  track  ^  *  1  of  T,,  h  =  0,  1 k,  then  we  will  get  a 

string 

where  aU  K ^  (J  =   1 c'***")  are  different,  and  each  AT,  is  of  form 

<  (/>o)c.  (^). (''Jc  > 

0  s  y,    <  £•',  (with  (h  ),  on  track  /),  /  =   0,  1 *. 

Suppose  A/  has  e  states.    From  /,  we  construct 

We  construct  VV|"'  on  tape  T-,  by  putting  a  different  state  of  A'  to  the  k.~  1"  track  of  each 
Kj  in  K.'  in  /,,  y   -    1.   .  .   .  ,c"*'". 

W|'''  can  now  be  represented  by  following  string: 
where  each  t    is  of  form 

with  (/i,)^  in  the  Z'*"  track,  /  =  0,  1,  .  .  .  .<:  and  q  in  the  k~  1"  track  of  t^. 

STEP  2  : 

It  is  enough  to  show  how  we  deal  with  one  track,  say  track  j,  of  T^.  First,  we  sort 
the  partial  traces  in  the  list  tV,"'  on  tape  T";  by  the  head  positions  on  track  j .  This  can  be 
done  within  0(()  reversals.  Then  using  one  more  reversal,  we  can  scan  tapes  T,  and  7,, 
reading  the  corresponding  symbol  on  the  h'^  square  in  the  track  i  in  tape  Tp  putting  the 
symbol  into  the  track  (<:*2)  +  y  under  the  corresponding  head  position  {h,)^  in  tape  7,, 
/i^  =  0,  1 c'-l. 

STEP  3  : 

To  topologically  sort  the  full  traces  in  the  list  W,''^  on  tape  T^,  we  only  have  to  do  a 
stable  sort  of  the  full  traces  by  the  head  position  on  track  0,  then  by  the  head  position  on 
track  1,  .....  finally  by  the  head  position  on  track  k.  This  can  be  done  in  0{\^  reversals. 
We  claim  that  if  t  :<  t'  and  t  and  t'  are  not  equivalent,  then  t  appears  before  t'  in  the 
list  W,<''.  In  fact,  if  t'  appears  before  t  in  the  list  W,*'\  then  at  least  on  one  of  the 
tracks,  the  head  position  of  t  is  'larger'  than  the  head  position  of  t'.  Thus  t  <  t'  can  not 
be  true. 

STEP  4  : 

For  each  of  the  full  traces  in  the  list  tV2*'\  we  can  decide  immediately  what  the 
'next'  action  is.    That  is,  given  a  full  trace 

we  can  decide  the  successor  partial  trace 
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t'  =  <  q' ,  /I'o,  *,  .  .  .  .h\,  *  > 
Thus  by  scanning  W^''\  within  0(1)  reversals,  we  can  construct  a  list  W,''^  such  that  the 
/'*  partial  trace  of  W3'"  is  the  successor  partial  trace  of  the  Z'*"  full  trace  of  W,'"  in  phase  1 
for  all  /.  (Remark:  suppose  t  i —  t'  in  phase  i,  where  both  t  and  t'  are  full  traces,  then 
the  head  positions  of  t'  will  be  obtained  by  adding  the  corresponding  head  positions  of  t 
by  1  or  0.  This  can  be  done  without  making  any  reversals  when  we  are  scanning  W,<". 
If  a  full  trace  does  not  have  successor  partial  trace  in  phase  i,  for  example,  it  corresponds 
to  the  end  configuration  of  phase  i.  we  just  designate  a  special  symbol  to  denote  its 
symbolic  successor.) 

So  step  4  can  be  done  in  0(i)  reversals. 

STEP  5  : 

Notice  that  unlike  the  proof  of  theorem  8,  the  current  symbol  of  each  trace  is 
uniquely  determined  by  the  head  position  so  there  is  no  need  to  convert  the  partial  traces 
of  W3*''  to  full  traces.  Hence  we  can  compare  the  full  traces  in  the  list  W,"'  with  the 
partial  traces  in  the  list  Wj'"  directly. 

STEP  6  : 

Now  collect  into  a  contiguous  list  W^'"  of  all  marked  full  traces  in  the  list  W2''\  in 
the  order  they  appear  m  the  list  WV".  Note  that  this  is  exactly  the  order  in  which  the 
corresponding  configurations  of  iV  appear  in  the  partial  computation  path  in  phase  /'. 
Now  it  should  be  clear  how  we  modify  the  tape  contents  of  7,  to  obtain  the  contents 
which  represents  the  start  configuration  of  next  phase  of  A^:  For  each  track  h,  by 
scanning  the  list  W^^'\  we  would  know  how  to  update  the  symbols  of  that  track  from  left 
to  right.    Note  that  0(1)  reversals  suffice  for  all  the  tracks. 

STEP  7  : 

We  reverse  the  contents  on  those  tracks  of  tape  T^  that  corresponds  to  tape  heads  of 
N  making  a  reversal  as  we  go  from  phase  1  to  phase  j  +  l. 

Since  N  has  s  r(n)  phases  and  A/  needs  at  most  0{i)  reversals  to  simulate  the  i''' 
phase  of  N,  M  makes  0(r-(n))  reversals.    This  completes  our  proof.    Q.E.D. 


6.    Reversal  Hierarchies 

Now  that  we  have  a  tape  reduction  theorem  for  reversal  complexity,  we  can  obtain 
reversal    analogues   of   well-known    theorems   on   space    and    time    complexity.     In   this 
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section,  we  prove  a  hierarchy  result  for  reversal  classes. 

Theorem  17.  (Reversal  Complexity  Hierarchy)  Let  r ^{n)  and  r-.{n)  be  complexity 
functions  such  that  {r-J^n))'  =  o{r^{n)),  r-^in)  =  fi(log/i)  and  r^{n)  is  reversal- 
constructible,  then 

DREVERSAL{r^(n))  -  DREVERSAL(r,(n))   ?t   0 

Proof.     Since   (r-,(n))'  =  o{r^{n)),   by  deterministic  space   hierarchy  theorem,   there 
exists  a  language  L  such  that 

L  6  DSPACE{r^{n)) 


and 

By  theorem  8, 
and  by  theorem  14, 
This  gives 


L  i   DSPACE({r.{n))-). 

L  e  DREVERSAL(r^{n)) 

L   ^   DREVERSAL{r,{n)). 

L  i  DREVERSAL{r^{n))  -   DREVERSAL{r.(n)). 


Q.E.D. 

Remark:    We  can  prove  this  theorem  directly  with  the  help  of  the  tape  reduction  theorem 
-  but  an  appeal  to  the  space  hierarchy  theorem  as  above  gives  a  shorter  proof. 


7.    Complete  Languages  for  Reversal  Complexity 
First  we  prove: 


Lemma  18.     Let  M    be   a   Turing   machine   that  accepts   in  f(n)    reversals.     If  f{n)    is 
reversal  constructible  then  there  exists  a  Turing  machine  N  such  that 

(1)  N  and  M  have  the  same  number  of  tapes; 

(2)  L{M)  =   L{N); 

(3)  N  makes  at  most  0(J{n))  reversals;  and 

(4)  N  always  halts. 

Proof.  Since /(n)  is  reversal  constructible,  iV  first  computes  the  unary  form  of/(n), 
appending  it  to  the  end  of  one  of  its  tapes.  N  will  use  it  as  a  counter  to  protect  itself 
from  making  too  many  reversals.  Then  N  simulates  M  step  by  step,  and  in  its  finite 
control  N  guards  the  'infinite  loop'  of  M .    In  this  description,  by  'infinite  loop'  we  mean 
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that  M  is  in  a  non-halting  computation  path  in  which  no  reversal  is  made.  Once  iV  finds 
out  that  M  runs  in  a  wrong  mode,  either  using  too  many  reversals  or  in  'infinite  loop',  N 
rejects  immediately.  It  is  easy  to  see  that  /V  satisfies  the  properties  stated  in  the  theorem. 
Q.E.D. 

Corollary.    If  f{n)  is  reversal  constructible ,  ihen  for  any  /i  2:  2 

co-DREVERSALjJ{n))   =  DREVERSAL^(J{n)) 

As  noted  in  Yap  [13],  hierarchy  theorems  and  the  existence  of  complete  languages 
for  a  complexity  class  can  be  proved  in  a  systematic  fashion  using  the  concept  of  'efficient 
universal  machines',  defined  as  follows: 

Definition.  A  Turing  machine  U  is  an  efficient  universal  machine  for  a  class 
DREVERSAL{F),  where  f  is  a  family  of  complexity  functions,  if 

(1)  U  takes  input  of  the  fcm  ipx  on  its  input  tape  where  (,  x  £    {0,  1)'; 

(2)  Let  LXU)  =    {x  \U  accepts  i#x],  then 

(2.1)  for  all  I  >  0,  there  exists  an  /  €  F  such  that  U  accepts  (#j:  within  0,(/(|x|)) 
reversals,  where  the  subscript  i  means  that  the  implicit  constant  in  the  0-notation 
depends  on  i; 

(2.2)  for  any  L  €  DREVERSAL{F),  where  L  C  {0,  1}',  there  are  infinitely  many 
indices  /  such  that  L  =  LXU)- 

For  a  general  universal  machine  U  (not  necessarily  efficient)  taking  input  of  the 
form  i#x,  if  we  fix  the  prefix  i#  of  the  input,  we  can  conveniently  regard  U  as  an 
ordinary  Turing  machine  t/,  which  accepts  the  language  L^U)  =  {x  \  U  accepts  (#jt}. 
Sometimes  we  write    {Uq,  U^,  .  .  .}  instead  of  U. 

We  have  following  theorem. 

Theorem  19.    The  classes  DREVERSAL{n°^^'').  and  DREVERSAL{0{\y)  have  efficient 
universal  machines. 

Proof.    We  just  show  the  result  for  DREVERSAL{n°^^^). 

Let  U  =  {Uq,  i/,,  .  .  .}  be  the  universal  Turing  machine  for  2-tape  deterministic 
Turing  machines.    We  construct   a  new  3-tape  Turing  machine  U  as  follows: 

For  a  given  input  i#x,  U  simulates  U,  on  input  x  step  by  step  for  at  most  |x|' 
reversals.  If  t/,  docs  not  accept  x  within  |j:|'  reversals,  U  rejects,  otherwise  U  accepts. 
To  simulate  U„  U  first  makes  0(2'^')  copies  of  the  'machine  code'  i  for  {/,  and  puts  them 
on  its  tape,  say,  T^.    This  can  be  done  within  0,([x|')  reversals  by  lemmas  4  and  11  . 
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Then 

U  can  simulate  each  step  of  t/,  by  first  reading  a  copy  of  the  'machine  code'  i  of  L',  on 
tape  T^  without  makmg  extra  reversals.  0(2  ')  copies  of  machine  code  are  enough  for 
the  whole  simulation  by  lemma  13  . 

It  is  easy  to  check  that  U  is  an  efficient  universal  machine  for  DREVERSALin"^^'). 
By  lemma  18  ,  wc  can  suppose  that  U  on  any  input  i#x  always  halts  and  makes  0,(|x|') 
reversals.  If  a  language  L  is  in  DREVERSAL{n'')  then  our  theorem  16  shows  that 
L  €  DREVERSAL.{n~''),  so  t7  can  recognize  L  efficiently.    Q.E.D. 

Theorem  20.    The  classes  DREVERSALin'^"')  and  DREVERSAL{0(\y )  have  complete 
language  under  Xag-space  many-one  reducibility. 

Proof.    Again,  let  us  consider  the  case  of  DREVERSALin'^"^). 

From  theorem  19,  we  may  suppose  that  U  =  {U^,  U^,  .  .  . }  is  the  efficient  universal 
Turing  machine  for  DREVERSAL(n'-"^^),  where  all  t/  always  halts  on  any  input,  and  U 
accepts  or  rejects  in  /,(i)  reversals,  /,(n)  is  a  polynomial. 

It  is  easy  to  check  that  following  language  is  DREVERSAL{n'-^'^')  complete: 
L  =    {w  =  (#x#0'"  :    m   =  /,(kl),  ^  €   U,\ 
Q.E.D. 


8.    Remarks 

In  this  paper,  we  proved  various  relationships  between  space  complexity  and 
reversal  complexity,  and  also  gave  the  important  tape  reduction  theorem  for  reversal 
complexity.  We  demonstrated  that  deterministic  reversal  and  space  arc  polynomially 
related.  If  we  accept  that  reversal  is  parallel  time,  then  we  have  yet  another  evidence  of 
the  so-called  'Parallel  Computation  Thesis'.  We  suggest  some  directions  for  the  further 
research. 

(1)      Can  our  result  that 

DREVERSAL(r)  C  DREVERSAL-.(r') 
be  improved  to  either 

DREVERSAL{r)  Q  DREVERSAL^(r^)  (•) 

or  to 
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DREVERSALir)  C  DREVERSAL^{r)  (••) 

for  some  constants  k  and  h  ?    We  believe  (*)  is  unlikely  to  be  true  and  (**)  seems 
to  require  new  techniques; 

(2)  The  problem  of  'speedup  reversal  complexity  by  a  constant  factor'  still  remains 
open.  We  know  that  there  are  languages  accepted  with  k  reversals  but  not  with  k-\ 
reversals.  So  any  linear  speedup  result  for  reversal  must  exclude  the  case  where  a 
constant  number  of  reversals  are  made. 
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